Families of Dendrograms
نویسنده
چکیده
A conceptual framework for cluster analysis from the viewpoint of p-adic geometry is introduced by describing the space of all dendrograms for n datapoints and relating it to the moduli space of p-adic Riemannian spheres with punctures using a method recently applied by Murtagh (2004b). This method embeds a dendrogram as a subtree into the Bruhat-Tits tree associated to the p-adic numbers, and goes back to Cornelissen et al. (2001) in p-adic geometry. After explaining the definitions, the concept of classifiers is discussed in the context of moduli spaces, and upper bounds for the number of hidden vertices in dendrograms are given.
منابع مشابه
Degenerating Families of Dendrograms
Dendrograms used in data analysis are ultrametric spaces, hence objects of nonarchimedean geometry. It is known that there exist p-adic representations of dendrograms. Completed by a point at infinity, they can be viewed as subtrees of the Bruhat-Tits tree associated to the p-adic projective line. The implications are that certain moduli spaces known in algebraic geometry are in fact p-adic par...
متن کاملar X iv : 0 70 7 . 40 72 v 1 [ st at . M L ] 2 7 Ju l 2 00 7 FAMILIES OF DENDROGRAMS
A conceptual framework for cluster analysis from the viewpoint of p-adic geometry is introduced by describing the space of all dendrograms for n datapoints and relating it to the moduli space of p-adic Riemannian spheres with punctures using a method recently applied by Murtagh (2004b). This method embeds a dendrogram as a subtree into the Bruhat-Tits tree associated to the p-adic numbers, and ...
متن کاملPleuronectiformes species identification along the Iranian coastline of the Persian Gulf
Pleuronectiforme fishes of the Persian Gulf coastlines along Khuzestan, Bushehr and Hormozgan provinces were morphometrically and meristically studied from April 2003 to September 2005, in order to identify species. In this experiment, 1551 fish samples were caught by trawl or collected from fish markets. The sampling was carried out in 27 regions seasonally. Thirty six traits and parameters in...
متن کاملPleuronectiformes species identification along the Iranian coastline of the Persian Gulf
Pleuronectiforme fishes of the Persian Gulf coastlines along Khuzestan, Bushehr and Hormozgan provinces were morphometrically and meristically studied from April 2003 to September 2005, in order to identify species. In this experiment, 1551 fish samples were caught by trawl or collected from fish markets. The sampling was carried out in 27 regions seasonally. Thirty six traits and parameters in...
متن کاملPALI - a database of Phylogeny and ALIgnment of homologous protein structures
PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain struc...
متن کامل